Automatic Generation of Titles for a Corpus of Questions

نویسندگان

  • Jesús Cardeñosa
  • Carolina Gallardo
چکیده

This paper describes the followed methodology to automatically generate titles for a corpus of questions that belong to sociological opinion polls. Titles for questions have a twofold function: (1) they are the input of user searches and (2) they inform about the whole contents of the question and possible answer options. Thus, generation of titles can be considered as a case of automatic summarization. However, the fact that summarization had to be performed over very short texts together with the aforementioned quality conditions imposed on new generated titles led the authors to follow knowledge-rich and domain-dependent strategies for summarization, disregarding the more frequent extractive techniques for summarization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Title Generation using EM

Our prototype automatic title generation system inspired by statistical machine-translation approaches [1] treats the document title like a translation of the document. Titles can be generated without extracting words from the document. A large corpus of documents with human-assigned titles is required for training title “translation” models. On an f1 evaluation score our approach outperformed ...

متن کامل

Automatic title generation for Chinese spoken documents using an adaptive k nearest-neighbor approach

The purpose of automatic title generation is to understand a document and to summarize it with only several but readable words or phrases. It is important for browsing and retrieving spoken documents, which may be automatically transcribed, but it will be much more helpful if given the titles indicating the content subjects of the documents. For title generation for Chinese language, additional...

متن کامل

Knowledge Extraction for Question Titling

This article describes the work carried out over the database of questions belonging to the different opinion polls carried out over the last 50 years in Spain. Approximately half of the questions are provided with a title while the other half is untitled. It is described the work and techniques implemented in order to automatically generate the titles for the corpus of untitled questions. The ...

متن کامل

Automatic Title Generation for Spoken Broadcast News

In this paper, we implemented a set of title generation methods using training set of 21190 news stories and evaluated them on an independent test corpus of 1006 broadcast news documents, comparing the results over manual transcription to the results over automatically recognized speech. We use both F1 and the average number of correct title words in the correct order as metric. Overall, the re...

متن کامل

Automatic Question Generation from Punjabi Text with Mcq Based on Hybrid Approach

Automatic question generation is an important area of Natural Language Processing that deals with the automatic generation of questions from the given sentence or paragraph in any Indian languages like Hindi, Punjabi, Marathi, Telugu, Gujarati, Urdu, Bengali, Malayalam, Kannada etc.,. This paper is presenting the research on automatic generation of questions from the given paragraph in Punjabi ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008